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Background of the Invention 



i. 



Field of the Invention 



10 



[0001] The invention relates to the programming of portable data carriers and to the 
execution of programs by portable data carriers. A portable data carrier within the meaning 
of the present document may be, in particular, a chip card (smart card), in various designs, or 
a chip module. 



2. Description of the Related Art 

[0002] Portable data carriers, as they are in common use today, have a processor core and 
a plurality of memories produced in different technologies. In a typical configuration, for 

20 example, a mask-programmed ROM, an electrically erasable and programmable EEPROM 
and a writeable RAM are provided. The RAM serves as a working memory during the 
running of the program, while the program code to be executed by the processor core can be 
stored in the ROM and/or in the EEPROM. These and similar designs of data carriers are 
described in section 3.4 of the book "Handbuch der Chipkarten" by W. Rankl and W. Effing, 

25 Hanser Verlag, third edition 1999. 



15 



[0003] Typically, a memory cell in the EEPROM occupies approximately four times the 
chip area of a ROM memory cell. In order to save chip area, or to achieve greater available 
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memory capacity with the same area, it is therefore desirable for the executable program code 
to be accommodated as extensively as possible in the ROM. However, it is necessary that the 
content of the mask-programmed ROM is unalterably defined for large numbers of data 
carriers as early as during the production stage of the mask-programmed ROM. Writing into 
5 the EEPROM, by contrast, is performed only upon the completion and initialisation of a 
series of data carriers, or when the individual data carriers are personalised. Due to the 
greater flexibility, therefore, it is advantageous for the executable program code to be stored 
as extensively as possible in the EEPROM. This applies both to the programming of smaller 
production volumes of data carriers and to the correction of faults and the introduction of 
10 additional functions in the case of large-volume production. 
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Summary of the Invention 

[0004] In the programming of portable carriers, there is therefore the problem of, on the 
one hand, using the mask-programmed ROM or a comparable memory as extensively as 
5 possible and, on the other hand, achieving as great a flexibility as possible for program 
changes and/or for the production of data carriers in smaller production volumes. 

[0005] According to the invention, the above problem is solved, wholly or partially, by a 
method for optimizing compiler-generated program code intended for a portable data carrier 

10 having both a processor core and a first and second memory area, the first memory area being 
provided to receive the optimized program code, the second memory area being provided to 
receive a predefined library having a multiplicity of library code fragments, and the compiler- 
generated program code being searched for program code fragments which correspond, at 
least in respect of their effect, to respectively one library code fragment, the program code 

15 fragments found thereby being replaced by respectively one call of the corresponding library 
code fragment. 

[0006] Further according to the invention, the above problem is solved, wholly or 
partially, by a computer program product comprising program instructions for a general- 

20 purpose computer which cause the general-purpose computer to optimize compiler-generated 
program code intended for a portable data carrier having both a processor core and a first and 
second memory area, the first memory area being provided to receive the optimized program 
code, the second memory area being provided to receive a predefined library having a 
multiplicity of library code fragments, and the compiler-generated program code being 

25 searched for program code fragments which correspond, at least in respect of their effect, to 
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respectively one library code fragment, the program code fragments found thereby being 
replaced by respectively one call of the corresponding library code fragment. 

[0007] Further according to the invention, the above problem is solved, wholly or 
5 partially, by a portable data carrier having a processor core, a first memory area and a second 
memory area, there being contained in the first memory area optimized program code, and 
there being contained in the second memory area a library which is predefined independently 
of the optimized program code and has a multiplicity of library code fragments, wherein the 
optimized program code has been obtained from compiler-generated program code by 
10 searching for program code fragments which correspond, at least in respect of their effect, to 
respectively one library code fragment, the program code fragments found thereby being 
replaced by respectively one call of the corresponding library code fragment. 

[0008] The sequence in which the steps are itemised in the claims relating to the method 
15 is not to be understood as a limitation of the extent of protection. Rather, developments of 
the invention are provided for in which these steps are performed in a different sequence, or 
wholly or partially in parallel, or wholly or partially interleaved with one another. 

[0009] The invention proceeds from the basic idea of using a predefined library 
20 containing a multiplicity of library code fragments for the purpose of optimizing the program 
code. In the optimization method according to the invention, the program code to be 
optimized, for its part, is searched for program code fragments which correspond in their 
effect or function to respectively one library code fragment. Such program code fragments 
are replaced by respectively one call of the corresponding library code fragment. The 
25 optimized program code is stored in a first memory area of the data carrier (e.g., in the 
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EEPROM), while the library is provided for storage in a second memory area (e.g., in the 
ROM). 

[0010] In tests performed by the inventors, the optimization procedure according to the 
5 invention resulted in a marked reduction of the size of the program code provided for the first 
memory area. This result is unexpected, since one would intuitively assume that, with a 
library of realistic extent, only few parts of the program code would be found to correspond 
to the library code fragments. 

10 [0011] The reduction of the code size brought about by the invention has the result that, 
in the case of a data carrier with a predefined amount of memory, program code for 
additional functions can be included in the first memory area. If the first memory area is 
designed as an EEPROM or in a comparable technology, this program code need only be 
loaded upon the completion or initialisation or personalisation of the data carrier. The 

15 program code which, due to its compactness, implements a multiplicity of functions, can 
therefore be changed or newly written both rapidly and even for small production volumes of 
data carriers, or even for single data carriers. 

[0012] According to the invention, the predefined library is located in the second memory 
20 area, i.e., for example, in the mask-programmed ROM. Normally, the saving of program 
code achieved by the optimization according to the invention is less than the size of the 
library. Even in this case, however, the application of the invention is advantageous, due to 
the better utilisation of the valuable first memory area. If in the compiler-generated program 
code there are many code fragments, groups of which in each case can be replaced by 
25 respectively one single code fragment of the library, and if the library contains only few code 
fragments which are not required, the optimization can result in the program code shrinking 
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by even more than the length of the library. In this case, use of the invention is advantageous 
even if the first and second memory areas are only conceptual sections of one and the same 
physical memory field. 

5 [0013] According to the invention, for the purpose of optimization a search is performed 
for program code fragments, i.e., for sections in the compiler-generated program code which 
can be replaced by corresponding library code fragments. This subsequent optimization 
procedure need not be taken into account by the programmer during program generation; in 
particular, the programmer need not make provision in the program for calls of library 
10 routines. Thus, programming is not in any way rendered more difficult by the invention. 

[0014] In the choice of words used here, the terms "program code" or "code fragment" 
are intended to denote both executable machine code, before or after linkage, and the 
corresponding assembler source code. In other words, in different developments of the 
15 invention the optimization procedure according to the invention can be performed both on the 
basis of the compiler-generated assembler source code and on the basis of the already 
assembled machine code. In the case of the former, the assembling and, if necessary, the 
linkage are performed only after optimization. The library, likewise, can be available during 
optimization as assembler source code and/or as already assembled machine code. 

20 

[0015] In general, a replacement of a program code fragment by a library code fragment 
is possible whenever both code fragments perform mutually corresponding functions. In this 
connection, complex calculations can be performed in respect of the exact effects of code 
fragments in order, for example, to initiate a replacement procedure even if individual 
25 instructions in the code fragments are commuted in an innocuous manner. In particularly 
simple exemplary embodiments, by contrast, a replacement is performed only if the code 
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fragments are identical in respect of the machine code defined by them. Even in the case of 
this simple development, however, a certain analysis of the code fragments is required, due to 
the fact that, for example, a code fragment having a jump with a jump destination which is 
not in the code fragment may not generally be replaced. 

5 

[0016] Additional replacement possibilities ensue if parameterised code fragments are 
used which, in a manner similar to a procedure call, contain one or more parameters (e.g., 
memory addresses or numerical values). 

10 [0017] Preferably, a library code fragment is normally called through a subroutine call 
instruction inserted in the program code. A return instruction, immediately following the 
library code fragment, is then provided in the library. Exceptions from this rule may apply in 
some embodiments if the code fragment to be replaced interferes with the program flow. If, 
for example, the code fragment ends with a subroutine return instruction, the call can 

15 normally be effected by means of a jump instruction. 

[0018] According to the invention, the library used is predefined, i.e., not dependent on 
the program code processed in the current optimization run. In order to achieve the best 
possible optimization results, however, the library is preferably designed so that it contains 
20 appropriate entries for frequently occurring structures of the program code. Such frequently 
occurring code sections may depend, in particular, on the hardware and/or an operating 
system of the data carrier and/or on a compiler used in the generation of the compiler- 
generated program code. 

25 [0019] The computer program product provided according to the invention may be, in 
particular, a computer-readable data carrier such as, for example, an electronic or magnetic or 
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optical memory medium, but it is not limited to physical data carriers. Electrical or optical 
signals (e.g., voltage levels of a communication link) are also to be understood as computer 
program product in the sense used here. The computer program product contains program 
code which executes the optimization steps according to the invention. Preferably, the 
5 computer program product additionally includes a compiler and/or an assembler and/or a 
linker and/or a loader program. 



[0020] The computer program product according to the invention and the portable data 
carrier according to the invention are preferably developed with features which correspond to 
10 the features described above and/or stated in the claims relating to the method. 



WACHP002/RvW 



Patent Application 



Brief Description of the Drawings 



[0021] Further features, objects and advantages of the invention are disclosed by the 
following description of an exemplary embodiment and a plurality of alternative 
embodiments. 

[0022] Reference is made to the schematic drawing, in which the sole figure (Fig. 1) 
shows a representation of a portable data carrier and of different versions of the program code 
in an exemplary embodiment of the invention. 
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Detailed Description of the Exemplary Embodiments 

[0023] The invention is used in the programming of a portable data carrier 10 which, in 
the exemplary embodiment described here, is designed as a chip card. The data carrier 10 
5 contains, in a manner known per se, a semiconductor chip having a processor core 12, a 
mask-programmed ROM 14, an EEPROM 16, a RAM 18, and an interface 20 for contactless 
or contact-bound communication. The said components are connected to one another via a 
bus 22. In alternative embodiments, the three memory fields 14, 16, 18 may be designed in 
other technologies; in particular, FLASH technology may be used for the ROM 14 and/or the 
10 EEPROM 16. 

[0024] A first and a second memory area 24, 26 are conceptually provided in the memory 
fields 14, 16, 18. The first memory area 24 serves to receive the optimized program code in 
the form of executable machine code. A predefined library 28, likewise in the form of 
executable machine code, is stored in the second memory area 26. In the exemplary 
embodiment described here, the first memory area 24 is located in the EEPROM 16, and the 
second memory area 26 is located in the ROM 14. In a manner known per se, the ROM 14 
contains, in addition to the second memory area 26, further, fixedly predefined routines 
which constitute, for example, an operating system of the data carrier 10. The EEPROM 16 
additionally includes a file system for data which are to be stored as non-volatile data in the 
data carrier 10. 

[0025] The library 28 has a multiplicity of predefined library code fragments 30A, 30B, 
30C, which are denoted generally in the following by 30x. In Fig. 1, for reasons of clearer 
25 representation, the library code fragments 30x are shown as assembler source code. 
Normally, each library code fragment 30x is followed immediately by a subroutine return 
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instruction 32 A, 32B, ... (denoted generally in the following by 32x). The subroutine return 
instruction 32x may be omitted, however, if it cannot be reached in the execution of the 
library code fragment 30x due to the fact that, for example, each program flow of the library 
code fragment 30x ends in an exit or in a subroutine return instruction contained in the library 
5 code fragment 3 Ox. 

[0026] The program development for the portable data carrier 10 proceeds from a high- 
level language source code 34, which is represented exemplarily in Fig. 1 in the programming 
language C. The section shown in Fig. 1 waits until the third bit of the input register 

10 ESTPORT out from the unit position attains the value "0", and then sets the output register 
OUTPORT to the hexadecimal value "FF". A compiler 36 known per se converts the high- 
level language source code 34 into compiler-generated program code 38, which in Fig. 1 is 
represented in the form of assembler source code for the 6805 instruction set. Other 
instruction sets, respectively in accordance with the processor core 12, are provided for in 

15 alternative embodiments. 

[0027] An optimization program 40 executes the optimization steps that are essential for 
the present exemplary embodiment. The optimization program 40 processes the compiler- 
generated program code 38 and, moreover, accesses information about the library code 

20 fragments 30x contained in the library 28. In different embodiment variants, this information 
can contain, for example, a copy of the library 28 in the assembler source code and/or a copy 
of the library 28 in the executable machine code and/or a specification of the effect of the 
individual library code fragments 3 Ox in an appropriate description language. Furthermore, 
additional information such as, for example, indexes or hash tables can be provided in order 

25 to accelerate the search procedures performed by the optimization program 40. 
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[0028] The optimization program 40 identifies program code fragments 42 contained in 
the compiler- generated program code 38 which, in execution by the processor core 12, have a 
function which is identical to that of library code fragments 30x contained in the library 28. 
Used for this purpose in the present exemplary embodiment is a relatively simple procedure, 
5 in which the compiler-generated program code 38 is compared, at assembler source text 
level, with the individual entries in the library 28. With regard to the instructions in short 
form and the address and value information, a textual comparison can be performed in this 
case. Symbolic jump destinations, by contrast, must be converted, prior to comparison, into a 
standardised form or into a numeric relative value. In alternative embodiments, by contrast, 
10 the optimization can be performed on the basis of a compiler-generated program code 38 
already present in the form of assembled machine code. 

[0029] A program code fragment 42 for which a corresponding library code fragment 30x 
has been found in the comparison procedure is replaced, in the optimization procedure, by a 

15 call of this library code fragment 30x. In Fig. 1, for example, the program code fragment 42 
and the library code fragment 3 0B are identical apart from the symbolic designation of the 
jump destination. In the optimized program code 44, therefore, the optimization program 40 
replaces this program code fragment 42 by a call of the library code fragment 30B. In the 
present example, this call is designed as a subroutine call instruction 46. Since, in the present 

20 example, the program code fragment 42 corresponds to a machine code of seven bytes in 
length and the subroutine call instruction 46 requires only three bytes, the memory space 
required for the optimized program code 44 has been substantially reduced by the 
replacement. 

25 [0030] Following completion of the optimization, the optimized program code 44 is 
converted by an assembler 48 into machine code that can be executed by the processor core 
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12. Following a possibly necessary linking operation with further program parts, the code is 
loaded into the first memory area 24 upon completion or initialisation or personalisation of 
the data carrier 10. The library 28 has already been present in the second memory area 26 
since the time at which the chip for the data carrier 10 was produced. The data carrier 10 is 
5 therefore ready for use. The translation, optimization and assembling steps described above 
are performed by a general-purpose computer (not shown in Fig. 1) which executes the 
compiler 36, the optimization program 40 and the assembler 48. 

[0031] When, in the operation of the data carrier 10, the program execution by the 
10 processor core 12 reaches the location of the subroutine call instruction 46 in the first 
memory area 24, the library code fragment 3 0B in the second memory area 26 is executed as 
a subroutine. In their effect, the executed instructions correspond exactly to the program 
code fragment 42 removed during the optimization. Following execution of these 
instructions, the processor core 12 executes a return, triggered by the subroutine return 
15 command 32B, to that instruction in the first memory area 24 which immediately follows the 
subroutine call instruction 46. 

[0032] It must be ensured during the optimization that the program functions are not 
altered. Thus, for example, program code fragments 42 having jump instructions which 

20 might have a jump destination located outside the program code fragment 42 should only be 
replaced following precise analysis. A replacement is allowable if each possible flow of the 
program code fragment 42 ends with an exit or a subroutine return. In such cases, however, 
the corresponding library code fragment 30x is called by means of a normal jump instruction 
rather than by means of a subroutine call instruction. These considerations may also be 

25 included even at the time of generation of the library 28, so that the latter contains only such 
library code fragments 30x that may be used without further constraints. 
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[0033] The library 28 should be of such construction that it provides appropriate library 
code fragments 30x as often as possible and thus offers as may optimization possibilities as 
possible. Thus, for example, the library code fragment 30B of Fig. 1 is matched to the 
5 hardware characteristics of the data carrier 10. If the input bit requested in this library code 
fragment 30B corresponds to a frequently required signal value, it is to be assumed that 
corresponding program code fragments 42 occur time and again in the compiler-generated 
program code 38 even for greatly differing applications of the data carrier 10. Similarly, 
frequent operating system calls can be covered by corresponding library code fragments 3 Ox. 
10 A further source for repeating code fragments in the compiler-generated program code 38 
results from the fact that the code generation in the compiler 36 is performed according to 
certain schemas, and recurring code structures are generated as a consequence. 

[0034] Overall, therefore, it is advantageous, for the purpose of generating the library 28, 
15 to statistically evaluate the program code 38 generated by the compiler 36 for a multiplicity 
of applications designated for the hardware and the operating system of the data carrier 10. 

[0035] The particulars contained in the above description of sample embodiments should 
not be construed as limitations of the scope of the invention, but rather as exemplifications of 
20 preferred embodiments thereof Accordingly, the scope of the invention should be determined 
not by the embodiments illustrated, but by the appended claims and their legal equivalents. 

What is claimed is: 
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